Optimal Worker Quality and Answer Estimates in Crowd-Powered Filtering and Rating
نویسندگان
چکیده
We consider the problem of optimally filtering (or rating) a set of items based on predicates (or scoring) requiring human evaluation. Filtering and rating are ubiquitous problems across crowdsourcing applications. We consider the setting where we are given a set of items and a set of worker responses for each item: yes/no in the case of filtering and an integer value in the case of rating. We assume that items have a true inherent value that is unknown, and workers draw their responses from a common, but hidden, error distribution. Our goal is to simultaneously assign a ground truth to the item-set and estimate the worker error distribution. Previous work in this area (Raykar and Yu; Whitehill et al.) has focused on heuristics such as Expectation Maximization (EM), providing only a local optima guarantee, while we have developed a general framework that finds a maximum likelihood solution. Our approach extends to a number of variations on the filtering and rating problems.
منابع مشابه
Optimal Crowd-Powered Rating and Filtering Algorithms
We focus on crowd-powered ltering, i.e., ltering a large set of items using humans. Filtering is one of the most commonly used building blocks in crowdsourcing applications and systems. While solutions for crowd-powered ltering exist, theymake a range of implicit assumptions and restrictions, ultimately rendering them not powerful enough for real-world applications. We describe two approache...
متن کاملGlobally Optimal Crowdsourcing Quality Management
We study crowdsourcing quality management, that is, given worker responses to a set of tasks, our goal is to jointly estimate the true answers for the tasks, as well as the quality of the workers. Prior work on this problem relies primarily on applying ExpectationMaximization (EM) on the underlying maximum likelihood problem to estimate true answers as well as worker quality. Unfortunately, EM ...
متن کاملArgonaut: Macrotask Crowdsourcing for Complex Data Processing
Crowdsourced workflows are used in research and industry to solve a variety of tasks. The databases community has used crowd workers in query operators/optimization and for tasks such as entity resolution. Such research utilizes microtasks where crowd workers are asked to answer simple yes/no or multiple choice questions with little training. Typically, microtasks are used with voting algorithm...
متن کاملOptimization techniques for human computation-enabled data processing systems
Crowdsourced labor markets make it possible to recruit large numbers of people to complete small tasks that are difficult to automate on computers. These marketplaces are increasingly widely used, with projections of over $1 billion being transferred between crowd employers and crowd workers by the end of 2012. While crowdsourcing enables forms of computation that artificial intelligence has no...
متن کاملTuning the Diversity of Open-Ended Responses From the Crowd
Crowdsourcing can solve problems beyond the reach of state-of-the-art fully automated systems (Bigham et al. 2010; Lasecki et al. 2011; 2012; Bernstein et al. 2011; von Ahn and Dabbish 2004; Attenberg, Ipeirotis, and Provost 2011; Aral, Ipeirotis, and Taylor 2011). A common pattern found in many such systems is for the workers to discover, in parallel, a number of candidate solutions and then v...
متن کامل